Approximations to Stochastic Dynamic Programs via Information Relaxation Duality

نویسندگان

  • Santiago R. Balseiro
  • David B. Brown
چکیده

In the analysis of complex stochastic dynamic programs (DPs), we often seek strong theoretical guarantees on the suboptimality of heuristic policies: a common technique for obtaining such guarantees is perfect information analysis. This approach provides bounds on the performance of an optimal policy by considering a decision maker who has access to the outcomes of all future uncertainties before making decisions, i.e., fully relaxed non-anticipativity constraints. A limitation of this approach is that in many problems perfect information conveys excessive power to the decision maker, which leads to weak bounds. In this paper we leverage the information relaxation duality approach of Brown, Smith, and Sun (2010) to show that by including a penalty that punishes violations of these non-anticipativity constraints, we can derive stronger bounds and analytically characterize the suboptimality of heuristic policies in stochastic dynamic programs that are too difficult to solve. We study three challenging problems: stochastic scheduling on parallel machines, a stochastic knapsack problem, and a stochastic project completion problem. For each problem, we use this approach to derive analytical bounds on the suboptimality gap of a simple policy. In each case, these bounds imply asymptotic optimality of the policy for a particular scaling that renders the problem increasingly difficult to solve. As we discuss, the penalty is crucial for obtaining good bounds, and must be chosen carefully in order to link the bounds to the performance of the policy in question. Finally, for the stochastic knapsack and stochastic project completion problems, we find in numerical examples that this approach performs strikingly well. Subject classifications: Dynamic programming, information relaxation duality, asymptotic optimality, stochastic scheduling, stochastic knapsack problems, stochastic project completion.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Information Relaxations, Duality, and Convex Stochastic Dynamic Programs

We consider the information relaxation approach for calculating performance bounds for stochastic dynamic programs (DPs). This approach generates performance bounds by solving problems with relaxed nonanticipativity constraints and a penalty that punishes violations of these nonanticipativity constraints. In this paper, we study DPs that have a convex structure and consider gradient penalties t...

متن کامل

Optimal Power Generation under Uncertainty via Stochastic Programming

A power generation system comprising thermal and pumped storage hy dro plants is considered Two kinds of models for the cost optimal generation of electric power under uncertain load are introduced i a dynamic model for the short term operation and ii a power production planning model In both cases the presence of stochastic data in the optimization model leads to multi stage and two stage stoc...

متن کامل

AFRL-AFOSR-VA-TR-2017-0028 Dynamic Decision Making under Uncertainty and Partial Information

The researchers made significant progress in all of the proposed research areas. The first major task in the proposal involved duality in stochastic control and optimal stopping. In support of this task, the researchers developed new methods for efficiently solving optimal stopping problems of partially observable Markov processes and optimal stopping problems under jump-diffusion processes. Th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016